Generating Ground Truthed Dataset: Automatic or Semi-automatic?
نویسندگان
چکیده
Ground truthing tools mainly fall into two categories: automatic and semi-automatic. In this paper, we first discuss the pros and cons of the two approaches. We then report our own work on designing and implementing systems for generating chart image dataset and multilevel ground truth data. Both semi-automatic and automatic approaches were adopted, resulting in two independent systems. The dataset as well as the ground truth data are publicly available so that other researchers can access them for evaluating and comparing performances of different systems.
منابع مشابه
Mapping Transcripts to Handwritten Text
In the analysis and recognition of handwriting, a useful first task is to assign ground truth for words in the writing. Such an assignment is useful for various subsequent machine learning tasks for performing automatic recognition, writer verification, etc. Since automatic word segmentation and recognition can be error prone, an intermediate approach is to use a text file that is a transcripti...
متن کاملA Framework for Evaluating Underwater Mine Detection and Classification Algorithms Using Augmented Reality
This paper presents a novel framework for evaluating Target Detection and Classification algorithms and concepts of operations based on Augmented Reality (AR). Real sonar images and synthetic target models are used to generate a ground-truthed AR theatre of operation. The detection/classification results of the human operator or Automatic Target Recognition (ATR) algorithm to be evaluated are t...
متن کاملAutomatic Prostate Cancer Segmentation Using Kinetic Analysis in Dynamic Contrast-Enhanced MRI
Background: Dynamic contrast enhanced magnetic resonance imaging (DCE-MRI) provides functional information on the microcirculation in tissues by analyzing the enhancement kinetics which can be used as biomarkers for prostate lesions detection and characterization.Objective: The purpose of this study is to investigate spatiotemporal patterns of tumors by extracting semi-quantitative as well as w...
متن کاملThe BEHAVE video dataset: ground truthed video for multi-person behavior classification
Although there is much research on behaviour recognition in time-varying video, there are few ground truthed datasets for assessing multi-person behavioral interactions. This short paper presents the BEHAVE project’s dataset, which has around 90,000 frames of humans identified by bounding boxes, with interacting groups classified into one of 5 different behaviors. An example of its use is also ...
متن کامل